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Abstract 

In this paper, we investigate the problem of cooperative relay in CR networks for further enhanced 
network performance. In particular, we focus on the two representative cooperative relay strategies, 
and develop optimal spectrum sensing and p-Persistent CSMA for spectrum access. Then, we study 
the problem of cooperative relay in CR networks for video streaming. We incorporate interference 
alignment to allow transmitters collaboratively send encoded signals to all CR users. In the cases of a 
\Q , single licensed channel and multiple licensed channels with channel bonding, we develop an optimal 

distributed algorithm with proven convergence and convergence speed. In the case of multiple channels 
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without channel bonding, we develop a greedy algorithm with bounded performance. 



I. Introduction 

Cooperative relay in CR networks 0], ED represents another new paradigm for wireless 
communications. It allows wireless CR nodes to assist each other in data delivery, with the 
objective of achieving greater reliability and efficiency than each of them could attain individually 
(i.e., to achieve the so-called cooperative diversity). Cooperation among CR nodes enables 
opportunistic use of energy and bandwidth resources in wireless networks, and can deliver many 
salient advantages over conventional point-to-point wireless communications. 

Recently, there has been some interesting work on cooperative relay in CR networks [HJ — [J3]| . 
In flU, the authors considered the case of two single-user links, one primary and one secondary. 
The secondary transmitter is allowed to act as a "transparent" relay for the primary link, motivated 
by the rationale that helping primary users will lead to more transmission opportunities for CR 
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nodes. In [0, the authors presented an excellent overview of several cooperative relay scenarios 
and various related issues. A new MAC protocol was proposed and implemented in a testbed to 
select a spectrum-rich CR node as relay for a CR transmitter/receiver pair. 

We investigate cooperative relay in CR networks, using video as a reference application to 
make the best use of the enhanced network capacity JH. We consider a base station (BS) and 
multiple relay nodes (RN) that collaboratively stream multiple videos to CR users within the 
network. To support high quality video service in such a challenging environment, we assume 
a well planned relay network where the RNs are connected to the BS with high-speed wireline 
links. Therefore the video packets will be available at both the BS and the RNs before their 
scheduled transmission time, thus allowing advanced cooperative transmission techniques to be 
adopted for streaming videos. In particular, we consider interference alignment, where the BS and 
RNs simultaneously transmit encoded signals to all CR users, such that undesired signals will be 
canceled and the desired signal can be decoded at each CR user flU, 0. In 0, such cooperative 
sender-side techniques are termed interference alignment, while receiver-side techniques that use 
overheard (or exchanged via a wireline link) packets to cancel interference is termed interference 
cancelllation. We present a stochastic programming formulation of the problem of interference 
alignment for video streaming in cooperative CR networks and then a reformulation of the 
problem based on Linear Algebra theory [8j, such that the number of variables and computational 
complexity can be greatly reduced. To address the formulated problem, we propose an optimal 
distributed algorithm with proven convergence and convergence rate, and then a greedy algorithm 
with a proven performance bound. 

The remainder of this paper is organized as follows. Related work is discussed in Section HO 
In Section [nil we compare two cooperative relay strategies in CR networks. We investigate 
the problem of cooperative CR relay with interference alignment for MGS video streaming in 
Section [IV] Section |V] concludes the paper. 

II. Background and Related Work 

The theoretical foundation of relay channels was laid by the seminal work ||9). The capac- 
ities of the Gaussian relay channel and certain discrete relay channels are evaluated, and the 
achievable lower bound to the capacity of the general relay channel is established in this work. 
In ifTOi . ifTTll . the authors described the concept of cooperative diversity, where diversity gains 

2 



are achieved via the cooperation of mobile users. In JT2l . the authors developed and analyzed 
low-complexity cooperative diversity protocols. Several cooperative strategies, including AF and 
DF, were described and their performance characterizations were derived in terms of outage 
probabilities. 

In practice, there is a restriction that each node cannot transmit and receive simultaneously 
in the same frequency band. The "cheap" relay channel concept was introduced in |[P3l . where 
the authors derived the capacity of the Gaussian degraded "cheap" relay channel. Multiple relay 
nodes for a transmitter-receiver pair are investigated in lfl4l and JT51 . The authors showed that, 
when compared with complex protocols that involve all relays, the simplified protocol with no 
more than one relay chosen can achieve the same performance. This is the reason why we 
consider single relay in this paper. 

In ifToll . Ng and Yu proposed a utility maximization framework for joint optimization of node, 
relay strategy selection, and power, bandwidth and rate allocation in a cellular network. Cai et 
al. ifTTl presented a semi-distributed algorithm for AF relay networks. A heuristic was adopted to 
select relay and allocate power. Both AF and DF were considered in lfT8l . where a polynomial 
time algorithm for optimal relay selection was developed and proved to be optimal. In lfl9l . 
a protocol is proposed for joint routing, relay selection, and dynamic spectrum allocation for 
multi-hop CR networks, and its performance is evaluated through simulations. 

The problem of video over CR networks has only been studied in a few recent papers J20l - 
J24l . In ETI . a dynamic channel selection scheme was proposed for CR users to transmit videos 
over multiple channels. In ll22ll . a distributed joint routing and spectrum sharing algorithm for 
video streaming over CR ad hoc networks was described and evaluated with simulations. In our 
prior work, we considered video multicast in an infrastructure-based CR network [20J, unicast 
video streaming over multihop CR networks [23 J and CR femtocell networks J25l . In ll24ll . 
the impact of system parameters residing in different network layers are jointly considered to 
achieve the best possible video quality for CR users. Unlike the heuristic approaches in ||2TT| . 
J22l . the analytical and optimization approach taken in this paper yields algorithms with optimal 
or bounded performance. The cooperative relay and interference alignment techniques also 
distinguish this paper from prior work on this topic. 

As point-to-point link capacity approaches the Shannon limit, there has been considerable 
interest on exploiting interference to improve wireless network capacity JU-JU, ll26ll . [|27ll . In 
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addition to information theoretic work on asymptotic capacity Q, O, practical issues have been 
addressed in 0, ll26l . Il27l . In ll26l . the authors presented a practical design of analog network 
coding to exploit interference and allow concurrent transmissions, which does not make any 
synchronization assumptions. In ll27ll . interference alignment and cancellation is incorporated 
in MIMO LANs, and the network capacity is shown, analytically and experimentally, to be 
almost doubled. In [7], the authors presented a general algorithm for identifying interference 
alignment and cancellation opportunities in practical multi-hop mesh networks. The impact of 
synchronization and channel estimation was evaluated through a GNU Radio implementation. 
Our work was motivated by these interesting papers, and we incorporate interference alignment 
in cooperative CR networks and exploit the enhanced capacity for wireless video streaming. 

III. CR and Cooperative Networking 

In this section, we investigate the problem of cooperative relay in CR networks [3J. We 
assume a primary network with multiple licensed bands and a CR network consisting of multiple 
cooperative relay links. Each cooperative relay link consists of a CR transmitter, a CR relay, and 
a CR receiver. The objective is to develop effective mechanisms to integrate these two wireless 
communication technologies, and to provide an analysis for the comparison of two representative 
cooperative relay strategies, i.e., decode-and-forward (DF) and amplify-and-forward (AF), in the 
context of CR networks. We first consider cooperative spectrum sensing by the CR nodes. We 
model both types of sensing errors, i.e., miss detection and false alarm, and derive the optimal 
value for the sensing threshold. Next, we incorporate DF and AF into the p-Persistent Carrier 
Sense Multiple Access (CSMA) protocol for channel access for the CR nodes. We develop 
closed-form expressions for the network-wide capacities achieved by DF and AF, respectively, 
as well as that for the case of direct link transmission for comparison purpose. 

Through analytical and simulation evaluations of DF and AF-based cooperative relay strategies, 
we find the analysis provides upper bounds for the simulated results, which are reasonably tight. 
We also find cross-point with the AF and DF curves when some system parameter is varied, 
indicating that each of them performs better in a certain parameter range. There is no case that 
one completely dominates the other for the two strategies. The considerable gaps between the 
cooperative relay results and the direct link results exemplify the diversity gain achieved by 
cooperative relays in CR networks. 
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CR transmitter, relay, and receiver licensed channels 




Fig. 1. Illustration of colocated primary and CR networks. The CR network consists of a number cooperative relay links, each 
consisting of a CR transmitter, a CR relay and a CR receiver. 

A. Network Model and Assumptions 

We assume a primary network and a spectrum band that is divided into M licensed chan- 
nels, each modeled as a time slotted, block-fading channel. The state of each channel evolves 
independently following a discrete time Markov process. 

As illustrated in Fig. \T\ there is a CR network colocated with the primary network. The CR 
network consists of N sets of cooperative relay links, each including a CR transmitter, a CR 
relay, and a CR receiver. Each CR node (or, secondary user) is equipped with two transceivers, 
each incorporating a software defined radio (SDR) that is able to tune to any of the M licensed 
channels and a control channel and operate from there. 

We assume CR nodes access the licensed channels following the same time slot structure 11281 . 
In the sensing phase, a CR node chooses one of the M channels to sense using one of its 
transceivers, and then exchanges sensed channel information with other CR nodes using the 
other transceiver over the control channel. During the transmission phase, the CR transmitter 
and/or relay transmit data frames on licensed channels that are believed to be idle based on 
sensing results, using one or both of the transceivers. We consider cooperative relay strategies 
AF and DF, and compare their performance in the following sections. 
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B. Cooperative Relay in CR Networks 

In this section, we investigate how to effectively integrate the two advance wireless communi- 
cation technologies, and present an analysis of the cooperative relay strategies in CR networks. 
We first examine cooperative spectrum sensing and derive the optimal sensing threshold. We 
then consider cooperative relay and spectrum access, and derive the network-wide throughput 
performance achievable when these two technologies are integrated. 

1) Spectrum Sensing : We assume there are N m CR nodes sensing channel m. After the 
sensing phase, each CR node obtains a sensing result vector m = [0™, Q™, ■ ■ ■ , 0^ m ] for 
channel m. The conditional probability a m (0 m ) on channel m availability is 
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If a m (Q m ) is greater than a sensing threshold r m , channel m is believed to be idle; otherwise 
channel m is believed to be busy. The decision variable D m is defined as follows. 

0, if a m (9 m ) > r m 

1, if fl m (O m ) ^ ^rrt- 

CR nodes only attempt to access channel m where D m is 0. Since function a m (0 m ) in © has 
N m binary variables, there can be 2 Nm different combinations corresponding to 2 Nm values for 
a m (0 m ). We sort the 2 Nm combinations according to their a m (0 m ) values in the non-increasing 
order. Let o™ be the jth largest function value and 9$ the argument that achieves the jth largest 
function value oSl where 
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In the design of CR networks, we consider two objectives: (i) how to avoid harmful interference 
to primary users, and (ii) how to fully exploit spectrum opportunities for the CR nodes. For 
primary user protection, we limit the collision probability with primary user with a threshold. Let 
7 m be the tolerance threshold, i.e., the maximum allowable interference probability with primary 
users on channel m. The probability of collision with primary users on channel m is given as 
Pr{/J m = 0|iJ™}; the probability of detecting an available transmission opportunity is Pr{D m = 
| H™}. Our objective is to maximize the probability of detecting available channels, while 
keeping the collision probability below 7to . Therefore, the optimal spectrum sensing problem 
can be formulated as follows. 

max Pr{D m = 0|# m } (3) 
subect to: Pr{D m = 0\H™} < 7m . (4) 

From their definitions, both Pr{D m = 1 H™} and Pi{D m = 1 H™} are decreasing functions 
of r m . As Pr{D m = 1 H™)} approaches its maximum allowed value 7 m , Pr{D m = 1 H™} also 
approaches its maximum. Therefore, solving the optimization problem © ~ © is equivalent 
to solving 

Pr{An = | H?} = 7m . 

(i) 

If T m = dm , we have 

Pr{£> m = 0|#r}(a m >) = Pr{a m (9 m ) > a$\H?} 

= £pr{a m (e m ) = aSlflT) = Dff^(l - a*™- (5) 
i=i i=i 

Obviously, Pr{D m = | H™}(a£) is an increasing function of j. The optimal sensing threshold 

(i) 

r m can be set to a™ , such that 

Pr{D m = 0\H™}(a£)< lm 

and 

Pr{D rn = Q\H™}(a^) > 7m . 
The algorithm for computing the optimal sensing threshold r m is presented in Table HI 
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TABLE I 

Algorithm for Computing the Optimal Sensing Threshold 



1: Compute a!J and the corresponding 9m , 
for all j; 

2: Initialize p c = Pr{a m (O m ) = dm'l-ffi™} and 

T -a (1) - 
3: Set j = 1; 
4: WHILE (p c < 7m ) 

5: 3=3 + 1; 

6- r -o W) - 

7: p c = p c + Pr{a m (6 m ) = a$t\H?}; 

8: END WHILE 



Once the optimal sensing threshold r* n is determined, Pr{D m = | H™} can be computed as 
given in © and Pi{D m = | i?™} can be computed as: 

Pr{D m = 0|# m } = Pr{a m (6 m ) > t*JH™} 

= £pr{a m (e m ) = og W} = J>rF (0 (l - eD^O- (6) 

2J Cooperative Relay Strategies : During the transmission phase, CR transmitters and relays 
attempt to send data through the channels that are believed to be idle. We assume fixed length 
for all the data frames. Let G\ and G\ denote the path gains from the transmitter to relay and 
from the relay to receiver, respectively, and let of fc and <j\ k denote the noise powers at the relay 
and receiver, respectively, for the A;th cooperative relay link. We examine the two cooperation 
relay strategies DF and AF in the following. For comparison purpose, we also consider direct 
link transmission below. 

a) Decode-and-Forward (DF): With DF, the CR transmitter and relay transmit separately 
on consecutive odd and event time slots: the CR transmitter sends data to the corresponding 
relay in an odd time slot; the relay node then decodes the data and forwards it to the receiver 
in the following even time slot, as shown in Fig. [2l 

Without loss of generality, we assume a data frame can be successfully decoded if the received 
signal-to-noise ratio (SNR) is no less than a decoding threshold k. We assume gains on different 
links are independent to each other. The receiver can successfully decode the frame if it is not 
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Odd time slot 



Even time slot 



DF: 



Channel 1 
Channel 2 
Channel 3 
Channel 4 



Busy 


Ri ■> D, 


Si Ri 


Busy 


S 2 -> R 2 


R 2 ~> D 2 


Busy 


Idle 



AF: 



Channel 1 
Channel 2 
Channel 3 
Channel 4 





S 2 -> R 2 


Si -> Ri 




R, ^ Di 


R 2 -> D 2 




Idle 



Fig. 2. Illustration of the protocol operation of AF and DF, where Si => Ri represents the transmission from source to relay 
and Ri Di represents the transmission from relay to destination, for the ith cooperative relay link. 



lost or corrupted on both links. The decoding rate of DF at the A;th receiver, denoted by Pp F , 
can be computed as, 

P k DF = Fx{(P s G\/al k >K) and [P r G\la\ k > k) } 

= (a 2 r>k K/P s ) (crl k K/P r ) , (7) 

where P s and P r are the transmit powers at the transmitter and relay, respectively, F G k (x) and 
F G k (x) are the complementary cumulative distribution functions (CCDF) of path gains G\ and 
G\, respectively. 

b) Amplify -and-Forward (AF): With AF, the CR transmitter and relay transmit simulta- 
neously in the same time slot on different channels. A pipeline is formed connecting the CR 
transmitter to the relay and then to the receiver; the relay amplifies the received signal and 
immediately forwards it to the receiver in the same time slot, as shown in Fig. [2l Recall that the 
CR relay has two transceivers. The relay receives data from the transmitter using one transceiver 
operating on one or more idle channels; it forwards the data simultaneously to the receiver using 
the other transceiver operating on one or more different idle channels. 

With this cooperative relay strategy, a data frame can be successfully decoded if the SNR at 
the receiver is no less than the decoding threshold k. Then the decoding rate of AF at the A;th 
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receiver, denoted as P\ F , can be computed as 

PAF - VX W- <, < k - K f-Ja M ) dF ^ (xl 

(8) 

c) Direct Link Transmission: For comparison purpose, we also consider the case of direct 
link transmission (DL). That is, the CR transmitter transmits to the receiver via the direct link; 
the CR relay is not used in this case. Let the path gain be Gq with CCDF F G k(x), and recall 
that the noise power is (j\ k at the receiver, for the A;th direct link transmission. 

Following similar analysis, the decoding rate of DL at the A;th receiver, denoted as P^ L , can 
be computed as 

P k DL = Pr{P s G k /al k >K}=F Gko (al k K/P s ). (9) 

3) Opportunistic Channel Access : We assume greedy transmitters that always have data 
to send. The CR nodes use p-Persistent CSMA for channel access. At the beginning of the 
transmission phase of an odd time slot, CR transmitters send Request-to-Send (RTS) with 
probability p over the control channel. Since there are iV CR transmitters, the transmission 
probability p is set to 1/N to maximize the throughput (i.e., to maximize Pi in (fTOb given 
below). 

The following three cases may occur: 

• Case 1: none of the CR transmitters sends RTS for channel access. The idle licensed 
channels will be wasted. 

• Case 2: only one CR transmitter sends RTS, and it successfully receives Clear-to-Send 
(CTS) from the receiver over the control channel. It then accesses some of or all the licensed 
channels that are believed to be idle for data transmission in the transmission phase. 

• Case 3: more than one CR transmitters send RTS and collision occurs on the control channel. 
No CR node can access the licensed channels, and the idle licensed channels will be wasted. 

Let P , Pi and P 2 denote the probability corresponding to the three cases enumerated above, 
respectively. We then have 

P = (l-p) N = (1-1/N) N (10) 
P 1 = Npil-p)"- 1 = (l-l/Nf- 1 (11) 
P 2 = 1-Pq-Pl (12) 
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The CR cooperative relay link that wins the channels in the odd time slot will continue to use 
the channels in the following even time slot. A new round of channel competition will start in 
the next odd time slot following these two time slots. 

Since a licensed channel is accessed with probability P\ in the odd time slot, we modify the 
tolerance threshold 7 m as 7^ = 7 m /Pi, such that the maximum allowable collision requirement 
can still be satisfied. In the even time slot, the channels will continue to be used by the winning 
cooperative relay link, i.e., to be accessed with probability 1. Therefore, the tolerance threshold 
is still 7 m for the even time slots. 

4) Capacity Analysis : Once the CR transmitter wins the competition, as indicated by a 
received CTS, it begins to send data over the licensed channels that are inferred to be idle 
(i.e., D m = 0) in the transmission phase. We assume the channel bonding and aggregation 
technique is used, such that multiple channels can be used collectively by a CR node for data 
transmission ll29Tl. Il30l. 

With DF, the winning CR transmitter uses all the available channels to transmit to the relay in 
the odd time slot. In the following even time slot, the CR transmitter stops transmission, while 
the relay uses the available channels in the even time slot to forward data to the receiver. If the 
number of available channels in the even time slot is equal to or greater than that in the odd time 
slot, the relay uses the same number of channels to forward all the received data. Otherwise, 
the relay uses all the available channels to forward part of the received data; the excess data 
will be dropped due to limited channel resource in the even time slot. The dropped data will be 
retransmitted in some future odd time slot by the transmitter. 

With AF, no matter it is an odd or even time slot, the CR transmitter always uses half of 
the available licensed channels to transmit to the relay. The relay uses one of its transceivers 
to receive from the chosen half of the available channels. Simultaneously, it uses the other 
transceiver to forward the received data to the receiver using the remaining half of the available 
channels. 

Let and be the decision variables of channel m in the odd and even time slot, 
respectively (see ©). Let and be the status of channel m in the odd and even time slot, 
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respectively. We have, 
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where Pr{S£f = j} are the probabilities that channel m is busy or idle, Pi{S^ = I \ — j} 
are the channel m transition probabilities. Pi{D^ = k \ = 1} and Pi{D^ = i \ = j} 
can be computed as in (0) and ©. 

Let Ndf, Naf and Ndl be the number of frames successfully delivered to the receiver in 
the two consecutive time slots using DF, AF and DL, respectively. Define = 1 — S!^, 
S™ = 1 - S™, D°J = 1- D°J and Df? = 1 - Df?. We have 
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EM riod rjod 
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(14) 
(15) 
(16) 



where x A y represents the minimum of x and y, and [xj means the maximum integer that is 
not larger than x. 

As discussed, the probability that a frame can be successfully delivered is P^ F , Pj| F , or P£ L 
for the three schemes, respectively. Recall that spectrum resources are allocated distributedly for 
every pair of two consecutive time slots. We derive the capacity for the three cooperative relay 
strategies as 

C DF = E[N DF ]-j:^ =1 (P^ F P l L)/(2NT s ) (17) 
Cap = E[N AF }-ELi( p 1fPiL)/(2NT s ) (18) 
C DL = E[N DL }-J2ti( P D L PiL)/(2NT s ), (19) 

where L is the packet length and T s is the duration of a time slot. The expectations are computed 
using the results derived in <fT3T ) ~ <fT6l) . 
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TABLE II 

Simulation Parameters and Values 



Symbol 


Value 


Definition 


M 


5 


number of licensed channels 


X 


0.7 


channel transition probability 






from idle to idle 


/' 


0.2 


channel transition probability 






from busy to idle 


'/ 


0.6 


channel utilization 


7 


0.08 


maximum allowable collision 






probability 


N 


7 


number of CR cooperative relay 






links 


Ps 


10 dBm 


transmit power of the CR 






transmitters 


Pr 


10 dBm 


transmit power of CR relays 


L 


1 kb 


packet length 


T s 


1 ms 


duration of a time slot 



C. Performance Evaluation 

We evaluate the performance of the cooperative relay strategies with analysis and simulations. 
The analytical capacities of the schemes are obtained with the analysis presented in Section HTl-B I 
The actual throughput is obtained using MATLAB simulations. The simulation parameters and 
their values are listed in Table HI] unless specified otherwise. We consider M — 5 licensed 
channels and a CR network with seven cooperative relay links. The channels have identical 
parameters for the Markov chain models. Each point in the simulation curves is the average 
of 10 simulation runs with different random seeds. We plot 95% confidence intervals for the 
simulation results, which are negligible in all the cases. 

We first examine the impact of the number of licensed channels. To illustrate the effect of 
spectrum sensing, we let the decoding rate P\ F be equal to P^f- I n Fig- El we plot the throughput 
of AF, DF, and DL under increased number of licensed channels. The analytical curves are upper 
bounds for the simulation curves in all the cases, and the gap between the two is reasonably 
small. Furthermore, as the number of license channels is increased, the throughput of both AF 
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5 6 7 
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Fig. 3. Throughput performance versus number of licensed channels. 



and DF are increased. The slope of the AF curves is larger than that of the DF curves. There 
is a cross point between five and six, as predicted by both simulation and analysis curves. This 
indicates that AF outperforms DF when the number of channels is large. This is because AF is 
more flexible than DF in exploiting the idle channels in the two consecutive time slots. The DL 
analysis and simulation curves also increases with the number of channels, but with the lowest 
slope and the lowest throughput values. 

In Fig. HI we demonstrate the impact of channel utilization on the throughput of the schemes. 
The channel utilization r] is increased from 0.3 to 0.9, when primary users get more active. As 
i] is increased, the transmission opportunities for CR nodes are reduced and all the throughputs 
are degraded. We find the throughputs of AF and DF are close to each other when the channel 
utilization is high. AF outperforms DF in the low channel utilization region, but is inferior to 
DF in the high channel utilization region. There is a cross point between the AF and DF curves 
between r] = 0.5 and r] = 0.6. When the channel utilization is low, there is a big gap between 
the cooperative relay curves and and the DL curves. 

In Fig. [5l we examine the channel fading factor. We consider Rayleigh block fading channels, 
where the received power is exponentially distributed with a distance-dependent mean. We fix 
the transmitter power at 10 dBm, and increase the relay power from one dBm to 18 dBm. As 
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0.3 0.4 0.5 0.6 0.7 0.8 0.9 

Channel Utilization (r\) 



Fig. 4. Throughput performance versus primary user channel utilization. 



the relay power is increased, the throughput is also increased since the SNR at the receiver is 
improved. We can see the increasing speed of AF is larger than that of DF, indicating that AF 
has superior performance than DF when the relay transmit power is large. The capacity analysis 
also demonstrate the same trend. The throughput of DL does not depend on the relay node. 
Its throughput is better than that of AF and DF when the relay transmit power is low, since 
both AF and DF are limited by the relay-to-receiver link in this low power region. However, 
the throughputs of AF and DF quickly exceed that of DL and grow fast as the relay-to-receiver 
link is improved with the increased relay transmit power. The considerable gaps between the 
cooperative relay link curves and the DL curves in Figs. [3l H] and [5] exemplify the diversity gain 
achieved by cooperative relays in CR networks. 

IV. Cooperative CR Networks with Interference Alignment 

In this section, we investigate cooperative relay in CR networks using video as a reference 
application. We consider a base station (BS) and multiple relay nodes (RN) that collaboratively 
stream multiple videos to CR users within the network. It has been shown that the performance 
of a cooperative relay link is mainly limited by two factors: 

• the half-duplex operation, since the BS-RN and the RN-user transmissions cannot be 
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Fig. 5. Throughput performance versus transmit power of relay nodes. 



scheduled simultaneously on the same channel [10]; and 

• the bottleneck channel, which is usually the BS-user and/or the RN-user channel, usually 
with poor quality due to obstacles, attenuation, multipath propagation and mobility [fT2ll . 

To support high quality video service in such a challenging environment, we assume a well 
planned relay network where the RNs are connected to the BS with high-speed wireline links, and 
explore interference alignment to overcome the bottleneck channel problem H|. Therefore the 
video packets will be available at both the BS and the RNs before their scheduled transmission 
time, thus allowing advanced cooperative transmission techniques (e.g. interference alignment) 
to be adopted for streaming videos. In particular, we incorporate interference alignment to allow 
transmitters collaboratively send encoded signals to all CR users, such that undesired signals 
will be canceled and the desired signal can be decoded at each CR user. 

We present a stochastic programming formulation, as well as a reformulation that greatly 
reduces computational complexity. In the cases of a single licensed channel and multiple li- 
censed channels with channel bonding, we develop an optimal distributed algorithm with proven 
convergence and convergence speed. In the case of multiple channels without channel bonding, 
we develop a greedy algorithm with a proven performance bound. 
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-"'control channel 



Fig. 6. Illustration of the cooperative CR network. 

A. Network Model and Assumptions 

The cooperative CR network is illustrated in Fig. [6j There is a CR BS (indexed 1) and 
(K — 1) CR RNs (indexed from 2 to K) deployed in the area to serve N active CR users. 
Let U = {1, 2, ■ • • , N} denote the set of active CR users. We assume that the BS and all the 
RNs are equipped with multiple transceivers: one is tuned to the common control channel and 
the others are used to sense multiple licensed channels at the beginning of each time slot, and 
to transmit encoded signals to CR users. We consider the case where each CR user has one 
software defined radio (SDR) based transceiver, which can be tuned to operate on any of the 
(M+l) channels. If the channel bonding/aggregation techniques are used ll29l . OTI . a transmitter 
can collectively use all the available channels and a CR user can receive from all the available 
channels simultaneously. Otherwise, only one licensed channel will be used by a transmitter and 
a CR user can only receive from a single chosen channel at a time. 

Consider the three channels in a traditional cooperative relay link. Usually the BS and RNs 
are mounted on high towers, and the BS-RN channel has good quality due to line-of-sight 
(LOS) communications and absence of mobility. On the other hand, a CR user is typically on 
the ground level. The BS-user and RN-user channels usually have much poorer quality due to 
obstacles, attenuation, multipath propagation and mobility. To support high quality video service, 
we assume a well planned relay network, where the RNs are connected to the BS via broadband 
wireline connections (e.g., as in femtocell networks G51 ). Alternatively, free space optical links 
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can also be used to provide multi-gigabit rates between the BS and the RNs [|32ll . As a result, 
the video packets will always be available for transmission (with suitable channel coding and 
retransmission) at the RNs at their scheduled transmission time. To cope with the much poorer 
BS-user and RN-user channels, the BS and RNs adopt interference alignment to cooperatively 
transmit video packets to CR users, while exploiting the spectrum opportunities in the licensed 
channels. 

1 ) Spectrum Access: The BS and the RNs sense the licensed channels and exchange their sens- 
ing results over the common control channel during the sensing phase. Given L sensing results 
obtained for channel m, the corresponding sensing result vector is Q™ = [6™, Q™, • • • , ]. Let 
Pm(®L ) := Pm(®T> ©2\ " " > @l ) be the conditional probability that channel m is available, 
which can be computed iteratively as shown in our prior work ll25l : 



Vm (^)i-er (i _ 5T) e? 
1 H x 



*£(ep) := ^(er.esv.-.e 



I I 



1 + 



1 



m\l-er/i rm\6™ 1 - 1 



(spy-®! (i - s, 



Y v 1 ' 1 1 ' \ I > O 



For each channel m, define an index variable D m {t) for the BS or RNs to access the channel 
in time slot t. That is, 

{0, access channel m in time slot t 
m = 1,2,- ..,M. (20) 
1, otherwise, 

With sensing result P^(0^), each channel m will be opportunistically accessed. Let the 
probability be P^(6^) that channel m will be accessed in time slot t (i.e., when D m {t) = 0). 
The optimal channel access probability can be computed as: 

P£(6?) = min { 7m / [l - P£@?j\ , l} . (21) 

Let ^4(t) be the set of available channels in time slot t. It follows that A(t) := {m \ D m {t) = 0}. 

2) Interference Alignment : We next briefly describe the main idea of interference alignment 
considered in this paper. Interested readers are referred to 0, [|27ll for insightful examples, a 
classification of various interference alignment scenarios, and practical considerations. 

Consider two transmitters (denoted as s\ and S2 ) and two receivers (denoted as d\ and c^)- 
Let X\ and X 2 be the signals corresponding to the packets to be sent to d\ and d 2 , respectively. 
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With interference alignment, the transmitters si and s 2 send compound signals a^iXi + a\^ 2 X 2 
and a 2 ^Xi + a 2j2 X 2 , respectively, to the two receivers d\ and d 2 simultaneously. If channel noise 
is ignored, the received signals Y\ and Y 2 can be written as: 



G T x A x X, 



(22) 



Y 2 G 2 \ Cj2,2 a 2,l o 22 X 2 

where G^j is the channel gain from transmitter Sj to receiver dj. 

From (|22l) . it can be seen that both receivers can perfectly decode their signals if the transfor- 
mation matrix A is chosen to be {G T } , i.e., the inverse of the channel gain matrix. With this 
technique, the transmitters are able to send packets simultaneously and the interference between 
the two concurrent transmissions can be effectively canceled at both receivers 0. 



B. Problem Formulation 

We formulate the problem of interference alignment for scalable video streaming over cooper- 
ative CR networks in this section. As discussed in Section IIII-AL the video packets are available 
at both the BS and all the RNs before their scheduled transmission time; the BS and RNs adopt 
interference alignment to overcome the poor BS-CR user and RN-CR user channels. 

Let Xj denote the signal to be transmitted to user j, which has unit power. As illustrated in 
Section ITV-A2[ transmitter k sends a compound signal J2jeu a k,jXj to all active CR users, where 
a k /s are the weights to be determined. Ignoring channel noise, we can compute the received 
signal Y n at a user n as: 

K N K N 

,jG kn Xj 

k=l j=l k=l j=l 

N K 

= y^Xj-y^ a kJ G k , n , n = 1, 2, • • • , N, (23) 

3=1 k=l 

where G k:U is the channel gain from the BS (i.e., k — 1) or an RN k to user n. For user n, only 
signal X n should be decoded and the coefficients of all other signals should be forced to zero. 
The zero-forcing constraints can be written as: 

K 

a k,j G k,n = 0, for all j ^ n. (24) 

k=l 
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Usually the total transmit power of the BS and every RN is limited by a peak power P max . 
Since X, has unit power, for all j, the power of each transmitted signal is the square sum of all 
the coefficients a\ .. The peak power constraint can be written as 

N 

} J K,j| 2 < Pmax, k = 1, • • • , K. (25) 
J'=l 

Recall that each CR user has one SDR transceiver that can be tuned to receive from any of the 
(M + 1) channels, when channel bonding is not adopted. Let W 1 be a binary variable indicating 
that user j selects licensed channel m. It is defined as 

{1, if user n receives from channel m 
j = l,...,N,m = l,---,M. (26) 
0, otherwise, 

Then, we have the following transceiver constraint: 

bf<l, j = l,-.-,N. (27) 

After introducing the channel selection variables 5™'s, the overall channel gain becomes 

meA(t) 

where ifP. is the channel gain from the BS (i.e., k = 1) or an RN k to user j on channel m. 

Let u>* be the PSNR of user j's reconstructed video at the beginning of time slot t and Wj 
the PSNR of user j's reconstructed video at the end of time slot t. In time slot t, u>j is already 
known, while W* is a random variable depending on the resource allocation and primary user 
activity during the time slot. That is, is a realization of Wj. 

The quality of reconstructed MGS video can be modeled with a linear equation 11331 : 

W(R) = a + /3xR } (29) 

where W(R) is the average peak signal-to-noise ratio (PSNR) of the reconstructed MGS video, 
R is the average data rate, and a and (3 are constants depending on the specific video sequence 
and codec. 

We formulate a multistage stochastic programming problem to maximize the sum of expected 
logarithm of the PSNR's at the end of the GOP, i.e., J2f=i ^ D°g(Wj r )] > f° r proportional fairness 
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among the video sessions 11341 . It can be shown that the multistage stochastic programming 
problem can be decomposed into T serial sub-problems, one for each time slot t, as 112311 : 

N 

maximize: J^E [log(Wj)|«;}] (30) 

subject to: Wj = w) + (31) 
bf E {0, 1}, a kJ > 0, for all m,j, k (32) 
Constraints (ED), ([25]) and (|2"7T> . 

where vp*. is a random variable that depends on spectrum sensing, power allocation, and channel 
selection in time slot t. This is a mixed integer nonlinear programming problem (MINLP), with 
binary variables 6™'s and continuous real variables a^/s. 

In particular, can have two possible values: (i) zero, if the packet is not successfully 
received due to collision with primary users; (ii) the PSNR increase achieved in time slot t if 
the packet is successfully received, denoted as A*. The PSNR increase can be computed as: 



!og 2 I 1 + ^ a k,jG kt ^j J , (33) 



where N is the noise power and B is the channel bandwidth. 

User j can successfully receive a video packet from channel m if it tunes to channel m (i.e., 
b™ = 1) and the BS and RNs transmit on channel m (i.e., with probability P®{®™)). The 
probability that user j successfully receives a video packet, denoted as Pj, is 

p ]= E b 7 P °(®L)- (34) 

Therefore, we can expand the expectation in (l30l) to obtain a reformulated problem: 

TV 

maximize: ^ E [Pj log(w* + A*-) + (1 - P$) log(w*)] (35) 
subject to: constraints ((241), (|251> , (|27T ), and (|32l) . 

C. Solution Algorithms 

In this section, we develop effective solution algorithms to the stochastic programming prob- 
lem (|30l) . In Section HV-C1[ we first consider the case of a single licensed channel, and derive a 
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distributed, optimal algorithm with guaranteed convergence and bounded convergence speed. We 
then address the case of multiple licensed channels. If channel bonding/aggregation techniques 
are used 112911 , PTL the distributed algorithm in Section [IV-Cll can still be applied to achieve 
optimal solutions. We finally consider the case of multiple licensed channels without channel 
bonding, and develop a greedy algorithm with a performance lower bound in Section IIV-C3I 
1) Case of a Single Channel : 

a) Property : Consider the case when there is only one licensed channel, i.e., when M — 1. 
The K transmitters, including the BS and (K — 1) RNs, send video packets to active users using 
the licensed channel when it is sensed idle. 

Definition 1. A set of vectors is linearly independent if none of them can be written as a linear 
combination of the other vectors in the set /(#]/. 

— # 

For user j, the weight and channel gain vectors are: dj = [aij,a2,j, ■ ■ ■ , clkJ\ and Gj = 
[Gij, G 2 j, ■ ■ ■ ,G K j] T , where T denotes matrix transpose. Due to spatial diversity, we assume 
that the Gj vectors are linearly independent ||5l. 

Lemma 1. To successfully decode each signal Xj, j = 1, 2, • • • , N, the number of active users 
N should be smaller than or equal to the number of transmitters K. 

Proof: From (|24j) . it can be seen that a, is orthogonal to the (N — 1) vectors G n 's, for 
n 7^ j. Since Sj is a K by 1 vector, there are at most (K — 1) vectors that are orthogonal to aj. 
Since the Gj vectors are linearly independent, it follows that (N — 1) < (K — 1) and therefore 



According to Lemma [H the following additional constraints should be enforced for the channel 
selection variables. 



That is, the number of active users receiving from any channel m cannot be more than the 
number of transmitters on that channel, which is K in the single channel case and less than or 
equal to K in the multiple channels case. We first assume that iV is not greater than K, and 
will remove this assumption in the following subsection. 



N < K. 



N 




for all m e A(t). 



(36) 
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b) Reformulation and Complexity Reduction : With a single channel, all active users 
receive from channel 1. Therefore &j = 1, and bj 1 — 0, for m > 1, j = 1,2, ■ • • , N. The 
formulated problem is now reduced to a nonlinear programming problem with constraints (|24l) . 
(1251) . and d32l) . If the number of active users is N — 1, the solution is straightforward: all the 
transmitters send the same signal to the single user using their maximum transmit power 
p 

1 max- 

In general, the reduced problem can be solved with the dual decomposition technique ll35l 
(i.e., a primal dual algorithm). This problem has K x N primal variables (i.e., the a k /s), and 
we need to define iV(iV — 1) dual variables (or, Lagrangian Multipliers) for constraints (|24)) and 
K dual variables for constraints (1251) . These numbers could be large for even moderate-sized 
systems. Before presenting the solution algorithm, we first derive a reformulation of the original 
problem (1351) that can greatly reduce the number of primal and dual variables, such that the 
computational complexity can be reduced. 

Lemma 2. Each vector Sj = [ai.j, a2j, ■ ■ • , CbK,j] T can be represented by the linear combination 
of r nonzero, linearly independent vectors, where r = K — iV + 1. 

Proof: From (|24l) . each vector Sj is orthogonal to G{ where j ^ i. Define a reduced matrix 
G_j obtained by deleting Gj from G, i.e., G_j = [G\, ■ ■ ■ , Gj-i, Gj+i, • • • , G N }. Then ctj is 
a solution to the homogeneous linear system Gl.-a; = 0. Since we assume that the Gj's are 
all linearly independent, the columns of G_j are also linearly independent 10. Thus the rank 
of G_j is (N — 1). The solution belongs to the null space of G_j. The dimension of the null 
space is r = K — (N — 1) according to the Rank-nullity Theorem [8J. Therefore, each Ej can 
be presented by the linear combination of r linearly independent vectors. ■ 

Let Bj = {e*/ i, e J>2 , • • • , e J>r .} be a basis for the null space of G_j. There are many methods 
to obtain the basis, such as Gaussian Elimination. However, we show that it is not necessary 
to solve the homogeneous linear system G J _-x = to get the basis for every different j value. 
Therefore the computational complexity can be further reduced. 

Our algorithm for computing a basis is shown in Table Hill In Steps 1-6, we first solve the 
homogeneous linear system G T x = to get a basis [vi, v 2 , ■ ■ ■ , vk-n}- Note that if K is equal 
to N, the basis is the empty set 0. We then set the K — N basis vectors to be the first K — iV 
vectors in all the basses ej, j — 1, 2, • • • , N. In Step 8, we orthogonalize each G_j and obtain 
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TABLE III 
Basis Computation Algorithm 



1: IF (K > N) 

2: Solve homogeneous linear system G T x — and get 

basis [«!,••• , vk-n]; 
3: FOR i = 1 to K - N 

4: ej t i = Vi, for all j; 

5: END FOR 

6: END IF 

7: FOR j = 1 to N 

8: Orthogonalize G-j and get (A — 1) orthogonal vectors Wj,i's; 

9: Calculate ej )V as in ( |37t ; 

10: END FOR 



(N — 1) orthogonal vectors Q 3 f i , i = 1, 2, ■ ■ • ,N — 1. Finally in Step 9, we let the rth vector 
e Jjr be orthogonal to all the tUj/s by subtracting all the projections on each ujj^ from Gj (recall 
that r = K — N + 1). The operation is: 

Zj,r = £j,N-K+l = Gj — 2_. ->j - ' (37) 

i=l u j,i u i,i 

Lemma 3. The solution space constructed by the basis [vi, V2, ■ ■ ■ , vk-n] is a sub-space of the 
solution space of GLjX = for all j. 

Proof: It is easy to see that each vector Vi is a solution of Gl^x = by substituting x with 
vu for i = 1,2, ••• ,K-N. ■ 

Lemma 4. The vectors [v*i, V2, ■ • • , vk-n, e^r] computed in Table\IT7\is a basis of the null space 
of G-j. 



Proof: Obviously, the Vi's are linearly independent. From (|37l) . it is easy to verify that e j r 
is orthogonal to all the Uj/s. Therefore, e^ r is also a solution to system GZjX = 0. Since Gj 
and tijj are orthogonal to all the t^'s, and e*j ir is a linear combination of Gj and ujj^, Cj^ r is also 
orthogonal and linearly independent to all the Vi's. The conclusion follows. ■ 
Define coefficients c*, = [c^i, Cj^, ■ ■ ■ , Cj jr ] T . Then we can represent dj as a linear combination 
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TABLE IV 

Comparison of Computational Complexity 





Original Problem 


Reformulated Problem 


Primal Variables 


KN 


(K-N + 1)N 


Dual Variables 


N(N -1)+K 


K 



j T 



of the basis vectors, i.e., dj = Y7i=i c j,i^j,i = e jCj- Eq. (T3~3l can be rewritten as 

0iB lo g 2 (i + w (^) 2 ) = ^r log2 { 1 + w M^) 2 ) • (38) 

The second equality is because the first K — N column vectors in are orthogonal to Gj. 
The random variable Wj in the objective function now only depends on c 3>r . The peak power 
constraint can be revised as: 

N 

Y^MQEj] 2 < P max , k = 1, ■ • • , K, (39) 

3=1 

where ej(k) is the kth row of matrix e^. 

With such a reformulation, the number of primal and dual variables can be greatly reduced. 
In Table [IV] we show the numbers of variables in the original problem and in the reformulated 
problem. The number of primary variables is reduced from KN to (K — N + 1)N, and the 
number of dual variables is reduced from iV(iV — 1) + K to K. Such reductions result in greatly 
reduced computational complexity. 

c) Distributed Algorithm : To solve the reformulated problem, we define non-negative dual 
variables jl = • • • ,/i^] T for the inequality constraints. The Lagrangian function is 

N K N 

£(c,/Z) = ^E[log(^( Cj g)|^.]+^ /Ufc (P ma ,-^[e J (A ; )c J ] 2 ) 

3=1 k=l j=l 

N K 

= y^CjjCj,^ + Pmax 22 ^ k ' ( 40 ) 
j=l k=l 

where c is a matrix consisting of all column vector c/s and 

K 

Cj^ft) = E [log(Wj(c i>P ))m] -J2^[e3(k)c 3 ] 2 . 

k=l 

The corresponding problem can be decomposed into iV sub-problems and solved iteratively 
Il3~51 . In Step r > 1, for given vector //(r), each CR user solves the following sub-problem using 
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local information 

Cj(r) = arg max £j(cj, /?(r)). (41) 

Obviously, the objective function in (I4TI) is concave. Therefore, there is a unique optimal solution. 
The CR users then exchange their solutions over the common control channel. To solve the primal 
problem, we adopt the gradient method [|351l . 

c,(r + 1) = Cjif) + 0VA(c,(t),m(t)), (42) 

where VCj(cj(r), (2(t)) is the gradient of the primal problem and is a small positive step size. 
The master dual problem for a given c(r) is: 

N K 

min ^ = 5^ /Z) + V n k . (43) 

i=l fe=l 

Since the Lagrangian function is differentiable, the subgradient iteration method can be adopted. 

fl(T + l) = [fI(r)-p(T)g(r)] + , (44) 

where p(r) = ^rS^^ - is a positive step size, j2* is the optimal solution, g(r) = Vq(j2(r)) 
is the gradient of the dual problem, and [-] + denotes the projection onto the nonnegative axis. 
Since the optimal solution jT is unknown a priori, we choose the mean of the objective values 
of the primal and dual problems as an estimate for ft* in the algorithm. The updated /ifc(r + 1) 
will again be used to solve the sub-problems (l4Tj) . Since the problem is convex, we have strong 
duality; the duality gap between the primal and dual problems will be zero. The distributed 
algorithm is shown in Table |Vl where < k 1 is a threshold for convergence. 

d) Performance Analysis : We analyze the performance of the distributed algorithm in this 
section. In particular, we prove that it converges to the optimal solution at a speed faster than 
y/T/r as r goes to infinity. 

Theorem 1. The series g(/2(r)) converges to q(fl*) as r goes to infinity and the square error 
sum Y^°=i( ( l(fi( T )) ~ q(fi*)) 2 is bounded. 
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TABLE V 

Algorithm for the Case of a Single Channel 

1: IF (N = 1) 

2: Set a k ,j to P max for all k; 

3: ELSE 

4: Set r = 0; /2(0) to positive values; c(0) to random values; 

5: Compute bases e/s as in Table [TTT1 

6: DO 

7: t = r + 1 ; 

8: Compute Cj(r) as in d42t ; 

9: Broadcast Cj(r) on the common control channel; 

10: Update /2(r) as in ( 144k 

11: WHILE (\\P(t) — P(t — 1)|| > k); 

12: Compute a^j's; 

13: END IF 



Proof: For the optimality gap, we have: 

||/2(r + 1) - /T|| 2 = ||[/2(t) - P (r)g(r)] + - /2*|| 2 
< ||/X(r)-p(rHr)-/ri| 2 

= Mr) - p*\\ 2 - 2 P (t)(/2(t) - /T) T s» + (p(r)) 2 | |£(t) 
= Mr) - /T|| 2 - 2p(r)(g(^(r)) - g(/T)) + (p(r)) 2 | \g(r) 
Since the step size is p(r) = ^fe^f^ , it follows that 



1 1 2 1 1 / _\ — 1 1 2 



||/l(T + l)-/f|| 2 < Mr)-/! 



(g(/2(r)) - g(/T)) 2 
1 1^)1 1 2 

< 11/2(7-)- /HI 2 - W^)-^)) 2 , (45) 

g 2 

where g 2 is an upper bound of ||(2(t)|| 2 . Since the second term on the right-hand-side of (1451) is 
non-negative, it follows that lim^oo g(/2(r)) = g(/2*). 
Summing Inequality (1451) over r, we have 



D#))-^))^ 9 2 \\m -/ri 



T = l 



That is, the square error sum is upper bounded. 
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Theorem 2. The sequence {q{jl{j))} converges faster than {1/a/t} as r goes to infinity. 

Proof: Assume lim^oo y^r(g(/7(r)) — q(jZ*)) > 0. Then there is a sufficiently large r' and 
a positive number £ such that y/r(q(p,(r)) — q(fi*)) > £, for all r > r'. Taking the square sum 
from r' to oo, we have: 

oo oo 

E(gWr)) - qmf >ej2- = ™- ( 46 ) 

t=t' t=t' 

Eq. (l46l) contradicts with Theorem [T} which states that X]^=i(q , (/^( t )) ~ q(fi*)) 2 * s bounded. 
Therefore, we have 

lim q M T) ] = 0, (47) 

indicating that the convergence speed of q(fi(r)) is faster than that of l/y/r. ■ 

2) Case of Multiple Channels with Channel Bonding : When there are multiple licensed 
channels, we first consider the case where the channel bonding/aggregation techniques are used 
by the transmitters and CR users [|29ll , OTTl . With channel bonding, a transmitter can utilize all 
the available channels in A(t) collectively to transmit the mixed signal. We assume that at the 
end of the sensing phase in each time slot, CR users tune their SDR transceiver to the common 
control channel to receive the set of available channels A(t) from the BS. Then each CR user 
can receive from all the channels in A(t) and decode its desired signal from the compound 
signal it receives. 

This case is similar to the case of a single licensed channel. Now all the active CR users receive 
from the set of available channels A(t). We thus have 6™ = 1, for m 6 A(t), and b™ = 0, for 
m ^ A(t), j = 1, 2, • • • , N. When all the 6™'s are determined this way, problem (l30l) is reduced 
to a nonlinear programming problem with constraints (124)) and (|25l) . The distributed algorithm 
described in Section ITV-C 1 1 can be applied to solve this reduced problem to get optimal solutions. 

3 ) Case of Multiple Channels without Channel Bonding : We finally consider the case of 
multiple channels without channel bonding, where each CR user has a narrow band SDR 
transceiver and can only receive from one of the channels. We first present a greedy algorithm 
that leverages the optimal algorithm in Table |V] for near-optimal solutions, and then derive a 
lower bound for its performance. 
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TABLE VI 

Channel Selection Algorithm for the Case of Multiple Channels without Channel Bonding 





Initialize b to a zero vector, user set U = {1, • • ■ 




1: 






and user-channel set C = U x .A(<); 




2: 


WHILE (C / 0) 




3: 


Find the user-channel pair {j',m'}, such that 






{j ,m. } = argmax{(j, m ) eC }{«(6 + Uj )- 


-$(6)}; 


4: 


Set 6 = 6 + £>y and remove j' from W; 




5: 


if (Ef =1 &r' = *o 




6: 


Remove m' from .A(t); 




7: 


END IF 




8: 


Update user-channel set C = U x .A(t); 




9: 


END WHILE 





a) Greedy Algorithm : When M > 1, the optimal solution to problem (1301 ) depends also on 
the binary variables 6™'s, which determines whether user j receives from channel m. Recall that 
there are two constraints for the 6™'s: (i) each user can use at most one channel (see ([27]) '): (ii) 
the number of users on the same channel cannot exceed the number of transmitters K (see (1361)). 
Let b be the channel allocation vector with elements 6™'s, and the corresponding objective 
value for a given user channel allocation b. 

We take a two-step approach to solve problem (1301 . First, we apply the greedy algorithm 
in Table [VI] to choose one available channel in A(t) for each CR user (i.e., to determine b). 
Second, we apply the algorithm in Table |V] to obtain a near-optimal solution for the given channel 
allocation b. 

In Table |VT1 v™ is a unit vector with 1 for the [(j — 1) x M + m)-th element and for all 
other elements, and b — b + v"f indicates choosing channel m! for user j'. In each iteration, the 
user-channel pair (j', m!) that can achieve the largest increase in the objective value is chosen, 
as in Step 3. The complexity of the greedy algorithm in the worst case is 0(K 2 M 2 ). 

b) Performance Bound : We next analyze the greedy algorithm and derive a lower bound 
for its performance. Let v\ be the sequence from the first to the Zth user-channel pair selected 
by the greedy algorithm. The increase in objective value is denoted as: 

F x := F{v h = $00 - $0_i). (48) 
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Sum up (EH) from 1 to L. We have Ya=i f i = $ 

(u L ) since $(z/ ) = 0. Let be the global 
optimal solution for user-channel allocation. Define iti as a subset of fi. For given v u iti is the 
subset of user-channel pairs that cannot be allocated due to the conflict with the /-th user channel 
allocation vi (but not conflict with the user-channel allocations in z^_i). 

Lemma 5. Assume the greedy algorithm stops in L steps, we have 

L 

$(fi) < $k) + Yl F ( a u ^-i> ^-i)- 

Proof: The proof is similar to the proof of Lemma 7 in E51 and is omitted for brevity. ■ 

Theorem 3. The greedy algorithm for channel selection in Table WL can achieve an objective 
value that is at least nm\\ of the global optimum in each time slot. 

Proof: According to Lemma [5J it follows that: 

L L 

$(fi) < $(v L )+J2Wi\Fi < <f>(v L ) + (\A(t)\-l)J2 F i = \A{t)M"L). (49) 
i=i i=i 

The second inequality is due to the fact that each user can choose at most one channel and there 
are at most (|^4(t)| — 1) pairs in tx\ according to the definition. The equality in (|49l) is because 
Yla=i Ft — Then we have: 

1 



! l(t) |$(fi)<$K)<$(fi). (50) 

The greedy heuristic solution is lower bounded by l/|^4(t)| of the global optimum. ■ 
Define competitive ratio % = < ^( Z7 l)/ c &(^) = Vl^(^)l- Assume all the licensed channels have 
identical utilization r\. Since \ A(t) \ is a random variable, we take the expectation of x an d obtain: 

M /1\ 

E[X] = V M + ( - ) - 1)" ^ 

n=l 



In Fig. |7l we evaluate the impact of channel utilization r/ and the number of licensed channels 
M on the competitive ratio. We increase t] from 0.05 to 0.95 in steps of 0.05 and increase M 
from 6 to 12 in steps of 2. The lower bound (|50l ) becomes tighter when r] is larger or when M is 
smaller. For example, when r\ = 0.6 and M — 6, the greedy algorithm solution is guaranteed to 
be no less than 52.7% of the global optimal, when r] is increased to 0.95, the greedy algorithm 
solution is guaranteed to be no less than 98.3% of the global optimal. 
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Channel Utilization (r\) 

Fig. 7. Competitive ratio E[x] defined in l |51t versus channel utilization r\. 

D. Performance Evaluation 

We evaluate the performance of the proposed algorithms with a MATLAB implementation and 
the JVSM 9.13 Video Codec. We present simulation results for the following two scenarios: (i) 
a single licensed channel and (ii) multiple licensed channels without channel bonding, since we 
observe similar performance for the case of multiple licensed channels with channel bonding. For 
comparison purpose, we also developed two simpler heuristic schemes that do not incorporate 
interference alignment. 

• Heuristic 1: each CR user selects the best channel in A(t) based on channel condition. The 
time slot is equally divided among the active users receiving from the same channel, to 
send their signals separately in each time slice. 

• Heuristic 2: in each time slot, the active user with the best channel is selected for each 
available channel. The entire time slot is used to transmit this user's signal. 

1 ) Case of a Single Licensed Channel: In the first scenario, there are K = 4 transmitters, 
i.e., one BS and three RNs. The channel utilization i] is set to 0.6 and the maximum allowable 
collision probability 7 is set to 0.2. There are three active CR users, each receives an MGS 
video stream from the BS: Bus to CR user 1, Mobile to CR user 2, and Harbor to CR user 3. 
The video sequences are in the Common Intermediate Format (CIF, 252x288). The GOP size 
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User Index 



Fig. 8. Received video quality for each CR user with a single channel. 



of the videos is 16 and the delivery deadline T is 10. The false alarm probability is ef 1 = 0.3 



and the miss detection probability is 5™ = 0.3 for all spectrum sensors. The channel bandwidth 
B is 1 MHz. The peak power limit is 10 W for all the transmitters, unless otherwise specified. 

We first plot the average Y-PSNRs of the three reconstructed MGS videos in Fig. [8l i.e., 
only the Y (Luminance) component of the original and reconstructed videos are used. Among 
three schemes, the proposed algorithm achieves the highest PSNR value, while the two heuristic 
algorithms have similar performance. Note that the proposed algorithm is optimal in the sin- 
gle channel case. It achieves significant improvements ranging from 3.1 dB to 5.25 dB over 
the two heuristic algorithms. Such PSNR gains are considerable, since in video coding and 
communications, a half dB gain is distinguishable and worth pursing. 

We next examine the convergence rate of the distributed algorithm. According to Theorem [2l 
the distributed algorithm converges at a speed faster than 1 / yfr asymptotically. We compare the 
optimality gap of the proposed algorithm, i.e., \q(r) — q*\, with series 10/y/r in Fig. [9j Both 
curves converge to as r goes to infinity. It can be seen that the convergence speed, i.e., the 
slope of the curve, of the proposed scheme is larger than that of 10/y/r after about 10 iterations. 
The convergence of the optimality gap is much faster than 10 / \fr, which exhibits a heavy tail. 

In the case of multiple channels with channel bonding, the performance of the proposed 
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Fig. 9. Convergence rate of the distributed algorithm with a single channel. 



algorithm is similar to that in the single channel case. We omit the results for lack of space. 

2 ) Case of Multiple Channels without Channel Bonding: We next investigate the second 
scenario with six licensed channels and four transmitters. There are 12 CR users, each streaming 
one of the three different videos Bus, Mobile, and Harbor. The rest of the parameters are the same 
as those in the single channel case, unless otherwise specified. Eq. (|49l) can also be interpreted 
as an upper bound on the global optimal, i.e., $(fi) < \A(t)\Q(v L ), which is also plotted in the 
figures. Each point in the following figures is the average of 10 simulation runs with different 
random seeds. The 95% confidence intervals are plotted as error bars, which are generally 
negligible. 

The impact of channel utilization 77 on received video quality is presented in Fig. [TOl We 
increase rj from 0.3 to 0.9 in steps of 0.15, and plot the Y-PSNRs of reconstructed videos 
averaged over all the 12 CR users. Intuitively, a smaller rj allows more transmission opportunities 
for CR users, thus allowing the CR users to achieve higher video rates and better video quality. 
This is shown in the figure, in which all four curves decrease as i] is increased. We also observe 
that the gap between the upper bound and proposed schemes becomes smaller as 77 gets larger, 
from 32.65 dB when rj = 0.3 to 0.63 dB when 77 = 0.9. This trend is also demonstrated in 
Fig. [7J The proposed scheme outperforms the two heuristic schemes with considerable gains, 
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Fig. 10. Reconstructed video quality vs. channel utilization r\ in the multi-channel without channel bonding case. 



ranging from 0.8 dB to 3.65 dB. 

Finally, we investigate the impact of the number of transmitters K on the video quality. In 
this simulation we increase K from 2 to 6 with step size 1. The average Y-PSNRs of all the 
12 CR users are plotted in Fig. [TT] As expected, the more transmitters, the more effective the 
interference alignment technique, and thus the better the video quality. The proposed algorithm 
achieves gains ranging from 1.78 dB (when K = 2) to 4.55 dB (when K = 6) over the two 
heuristic schemes. 

V. Conclusions 

In this paper, we first studied the problem of cooperative relay in CR networks. We modeled the 
two cooperative relay strategies, i.e., DF and AF, which are integrated with p-Persistent CSMA. 
We analyzed their throughput performance and compared them under various parameter ranges. 
Cross-point with the AF and DF curves are found when some parameter is varied, indicating 
that each of them performs better in a certain parameter range; there is no case of dominance 
for the two strategies. Considerable gains were observed over conventional DL transmissions, 
as achieved by exploiting cooperative diversity with the cooperative relays in CR networks. 

Then, we investigated the problem of interference alignment for MGS video streaming in 
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Fig. 11. Reconstructed video quality vs. number of transmitters K in the multi-channel without channel bonding case. 



a cooperative relay enhanced CR network. We presented a stochastic programming formation, 
and derived a reformulation that leads to considerable reduction in computational complexity. 
A distributed optimal algorithm was developed for the case of a single channel and the case of 
multi-channel with channel bonding, with proven convergence and convergence speed. We also 
presented a greedy algorithm for the multi-channel without channel bonding case, with a proven 
performance bound. The proposed algorithms are evaluated with simulations and are shown to 
outperform two heuristic schemes without interference alignment with considerable gains. 
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